A Novel Spatio-Temporal-Wise Network for Action Recognition
نویسندگان
چکیده
Action recognition is a challenging task that requires understanding the temporal relationships between frames. However, capturing and processing spatio-temporal motion features computationally expensive, making it difficult to apply practical situations. We propose novel approach called Spatio-Temporal-Wise (STW) network address this problem. The STW inserts blocks, consisting of Spatio-Temporal Fusion Module Temporal-Wise Module, into an existing 2D CNN. This very little additional computational overhead but brings huge performance improvements in recognizing human actions. proposed method evaluated on several public datasets, including Something-Something v1 & v2, Kinetics-400, UCF101, HMDB51. achieved comparable or better these datasets compared state-of-the-art methods. Notably, improves accuracy by 26.6% 34.6% v2 respectively, with less than 2% overhead. results demonstrate can significantly improve action tasks while requiring only small overhead, which represents promising direction for developing more efficient effective approaches handling reasoning recognition, may have important applications future.
منابع مشابه
Deep Spatio-temporal Manifold Network for Action Recognition
Visual data such as videos are often sampled from complex manifold. We propose leveraging the manifold structure to constrain the deep action feature learning, thereby minimizing the intra-class variations in the feature space and alleviating the over-fitting problem. Considering that manifold can be transferred, layer by layer, from the data domain to the deep features, the manifold priori is ...
متن کاملSpatio-temporal SURF for Human Action Recognition
In this paper, we propose a new spatio-temporal descriptor called ST-SURF. The latter is based on a novel combination between the speed up robust feature and the optical flow. The Hessian detector is employed to find all interest points. To reduce the computation time, we propose a new methodology for video segmentation, in Frames Packets FPs, based on the interest points trajectory tracking. W...
متن کاملHuman Action Recognition Using Spatio-temporal Classification
In this paper a framework “Temporal-Vector Trajectory Learning” (TVTL) for human action recognition is proposed. In this framework, the major concept is that we would like to add the temporal information into the action recognition process. Base on this purpose, there are three kinds of temporal information, LTM, DTM, and TTM, being proposed. With the three kinds of proposed temporal informatio...
متن کاملSpatio-temporal Aware Non-negative Component Representation for Action Recognition
This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR). The proposed STANNCR is based on action component and incorporates the spatial-temporal information. We first introduce a spatial-temporal distribution vector (STDV) to model the distributions of local feature locations in a compact and discri...
متن کاملImproved Spatio-temporal Salient Feature Detection for Action Recognition
Spatio-temporal salient features localize the local motion events and are used to represent video sequences for many computer vision tasks such as action recognition. The robust detection of these features under geometric variations such as affine transformation and view/scale changes is however an open problem. Existing methods use the same filter for both time and space and hence, perform an ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3274542